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ABSTRACT 


A 

A  new  adaptive  processing  algorithm,  called  Weighted  Covariance  Estima¬ 
tion  (WCE) ,  for  the  detection  of  signals  in  inteiference  of  unknown  character 
is  presented.  Its  main  advantage  over  present  techniques,  such  as  Sample 
Matrix  Inversion  (SMI) ,  is  its  tendency  not  to  suppress  desired  signals 
present  in  the  learning  data. 

WCE  and  SMI  are  compared  for  a  radar  problem  of  practical  interest, 
adaptive  MTI  from  a  moving  platform,  using  both  simulated  and  actual  radar 
data. 
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INTRODUCTION 

Adaptive  array  processing ^  is  a  powerful  technique  for  detecting  known 
signals  immersed  in  a  Gaussian  interference  background.  The  statistics  of 
Gaussian  interference  are  completely  characterized  by  a  covariance  matrix 
which,  together  with  the  signal,  determines  the  optimum  receiver  structure. 

If  the  covariance  matrix  is  not  known,  it  must  be  estimated  from  the  available 
observations. 

The  problem  of  interest  here  is  the  adaptive  detection  of  radar  returns 

from  moving  targets  in  the  presence  of  ground  clutter  and  receiver  noise.  The 

radar  return  from  each  transmitted  signal  is  received  by  M  sensors  and  matched- 

filtered.  The  resulting  outputs  are  sampled  at  the  Nyquist  rate,  yielding 

complex  samples  in  N  range  cells  from  each  sensor.  One  (vector)  observation 

consists  of  the  M  complex  samples  from  a  particular  range  cell.  To  obtain  the 

necessary  number  of  independent  observations  of  the  interference  process,  it 

is  commonly  assumed  that  the  interference  covariance  matrix  is  the  same  in 

every  range  cell.  The  maximum  likelihood  estimate  of  the  covariance  matrix  is 

[2] 

then  the  sample  covariance  matrix  ,  which  is  inverted  and  used  to  determine 
the  optimum  receiver  weighting ^ 3 ^ .  This  technique  has  been  called  Sample 
Matrix  Inversion  (SMI) . 

This  paper  proposes  a  different  model  for  the  interference  statistics  and 
derives  the  maximum  likelihood  estimate  of  the  interference  covariance  matrix. 
This  estimate  is  more  expensive  computationally  than  the  sample  covariance, 
but  offers  greater  immunity  to  the  suppression  of  desired  signals. 

The  performance  of  the  algorithm  is  compared  with  that  of  SMI  using  both 
simulated  data  and  real  data  from  an  airborne  MTI  radar. 

Adaptive  Detection 

The  problem  of  detecting  a  (complex)  signal  in  colored  Gaussian  inter¬ 
ference  given  K  (scalar)  complex  observations  may  be  stated  as  follows: 

Under  hypothesis  Hq  (no  signal  present) ,  the  observations  are  of  the  form 

H0!  Zk  =  \  k  =  1,  .  .  .K 


1 


while  under  hypothesis  (signal  presentl  they  take  the  form 

Hr  zk  =  6sk(^  +  \ 


Here  0  is  an  unknown  complex  number,  acknowledging  the  fact  that  the 
amplitude  and  phase  of  the  signal  are  unknown.  Any  additional  unknown  param¬ 
eters  are  contained  in  the  vector  argument  a.  Under  the  Gaussian  assumption, 

the  interference  is  completely  characterized  by  its  first-  and  second-order 

.  .  + 

statistics 


E(nk)  =  =  0 


E(nkV 


A 


ki, 


The  detection  strategy  is  based  on  the  generalized  likelihood  ratio  test 


JUz) 


max  p(z|0,a,H  ) 
B,a 


P(z|hq) 


(1) 

* 

Thus  each  observation  z^  is  multiplied  by  a  complex  weight  w^,  the  resulting 
complex  numbers  are  summed,  and  the  magnitude  of  the  resultant  is  compared 
with  a  threshold.  The  weighting  vector  w  is  given  by 

w  =  A  1£(a) 

t  *  denotes  the  complex  conjugate  of  a  scalar  and  the  complex  conjugate 
transpose  of  a  vector  or  matrix. 


It  can  be  shown  that  the  test  reduces  to 

H1 

■  *  -1  i  > 

max  s  (a) A  z  <  X' 
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where  a  is  the  vector  of  parameter  values  that  maximizes  (1) . 

When  some  or  all  of  the  elements  of  A  are  unknown,  the  test  becomes 


max  p(zjB,a,A  ,H  ) 

8, a, A  U 

0(z)  =  _ H _ _ _ 

—  max  p (z I A  ,H  ) 

—  u  o 

A 

u 

where  A^  denotes  the  set  of  unknown  elements  of  A.  Evidently,  if  the  number 
of  (real)  unknown  parameters  exceeds  the  number  of  (real)  observations,  the 
observation  space  cannot  be  mapped  1:1  into  the  parameter  space.  Additional 
independent  observations  are  required  in  order  to  form  meaningful  estimates. 

In  most  practical  detection  problems,  the  presence  of  a  signal  is  an 
unlikely  event.  Multiple  independent  observations  under  are  thus  not 
available.  On  the  other  hand,  the  interference  environment  is  often  statisti¬ 
cally  stationary  (or  quasi-stationary)  in  time  and/or  space,  so  that  addi¬ 
tional  independent  observations  under  Hq  (no  signal)  are  easily  obtained.  In 
such  cases,  it  is  reasonable  to  estimate  interference  statistics  assuming  that 
Hq  is  true,  and  use  the  results  to  estimate  signal  parameters. 

Specifically,  if  N  independent  observations  z_  ,  . . .z  are  available  under 

a  -L  “lN 

H  ,  find  the  maximum  likelihood  estimate  A  of  all  the  unknown  statistical 
o  u 

parameters.  Given  another  independent  observation  z.,  , ,  use 

— N+l 


max  1  3, a, A  ,H_) 

„  ~Nl  -  u  1 

P  /  cL 


p(wvho> 


as  the  detection  statistic  to  decide  whether  or  not  a  signal  is  present  in 

z„,,.  In  reality,  of  course,  any  of  the  z  may  contain  a  signal,  so  the 
—N+l  — n 

estimate  of  the  interference  statistics  is  based  on  all  N+l  observations,  and 
used  to  test  each  observation  for  the  presence  of  a  signal.  The  hope  is  that 
the  presence  of  a  few  signals  in  the  N+l  observations  will  not  seriously  per¬ 
turb  the  estimates  of  interference  statistics. 
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The  Sample  (covariance)  Matrix  Inversion  (SMI)  technique is  a  special 

case  of  this  adaptive  detection  strategy  in  which  the  entire  interference 

covariance  matrix  is  unknown,  but  assumed  constant  for  all  observations. 

[2] 

Goodman  shows  that  the  maximum  likelihood  estimate  of  the  covariance  matrix 
is  the  sample  covariance  matrix 


A 


1 

N 


N 


£ 


z  z 
— n— n 


and  derives  the  joint  characteristic  function  and  probability  density  of  its 
N2  real  random  variables. 

This  paper  proposes  an  alternative  model  in  which  the  interference  co- 
variance  matrix  for  the  n  independent  observation  is  of  the  form 


A 


n 


o2C 

n 


(2) 


2 

where  the  variances  {a  ,  n=l,  ..N)  and  the  elements  of  C  are  unknown. 

n 

This  form  of  the  covariance  matrix  is  suggested  by  the  airborne  multi¬ 
antenna  radar  problem.  The  assumption  of  a  single  interference  covariance 
matrix  common  to  all  range  cells  does  not  seem  appropriate,  since  clutter 
reflectivity  often  varies  significantly  from  range  cell  to  range  cell.  However, 
some  sort  of  statistical  regularity  across  range  cells  is  needed;  otherwise, 
there  are  too  many  parameters  to  be  estimated. 

The  derivation  of  the  clutter  covariance  matrix  for  this  problem  is 
presented  in  Appendix  A  in  order  to  highlight  the  many  assumptions  and  approxi¬ 
mations  which  are  involved.  The  result  is  that  the  clutter  covariance  matrix 
has  the  form  given  in  (2) .  The  covariance  matrix  of  interest  is  actually  that 
of  the  interference,  which  consists  of  both  clutter  and  thermal  noise.  This 
matrix  takes  the  form 

2  2 

A  =  o  C  +  a  I 
n  n  o 

2 

where  0q  is  the  thermal  noise  power  per  complex  sample.  The  covariance  model 
used  in  this  paper  is  valid  only  if  the  clutter  return  is  much  larger  than 
thermal  noise. 


* 
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Derivation  of  ML  Covariance  Estimate 

The  joint  probability  density  of  N  independent  vector  observations  of 
interference,  each  having  M  complex  elements,  is 


N 

n 

n=l 


M  2M  | 
it  a 

n  1 


exp  <- 


z 

— n 


C_1z 

—n 


The  ML  estimates  maximize  this  expression ,  or  equivalently  its  logarithm, 

2  1  *  —1 

i  =  in  p(z.,..z  )  =  -MNinir  -MYjino  -Nfcnjcl  -T'— -r  z  C  z  (3) 

—1  — N  n  ‘  1  2  — n  — n 

a 

n 

2 

with  respect  to  {a  >0,  n=l,...N)  _and  the  matrix  C.  This  maximization  will 
n  2 

first  be  performed  with  respect  to  ,  with  all  other  parameters  held  fixed. 
The  log  likelihood  ratio  (LLR)  can  be  written  as 


i  -  -MNAmr  -N«.n|c|  +  ]£ 


- 

* 

1  1 

_  _1_  *  -1  i 

l 

2M  6XP  | 

2 

O 

.  o 

n 

'  n 
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Each  term  in  the  sum  has  the  form 


.  .  M  -ax. 

i n(x  e  ) 


M 

which  has  a  unique  maximum  at  x  =  — .  Thus 


"  2  1  *  -1 
a  =  -  z  c  z 
n  M  — n  — n 


is  the  ML  estimate  of  a  ,  if  C  is  known. 

n 


a,x  >  0 


Consider  next  the  maximization  with  respect  to  the  M  x  M  matrix  C,  with 
2  [2] 

the  0^  fixed.  Following  Goodman  ,  the  LLR  can  be  written 
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I  =  constant  -NJin  C  -NTr{C  S} 


,  z  z 
1  t-'— n— n 


where  S  =  -  Goodman's  proof  can  then  be  applied  unchanged  to  estab¬ 


P  *-»  2 

o 

n 


lish  that 


C  =  S 

2 

is  the  maximum  likelihood  estimate  of  C,  provided  the  o  are  known. 

2  n 

In  fact,  both  C  and  the  are  unknown.  Consequently,  the  ML  estimates 
must  be  obtained  as  the  simultaneous  solution  of  the  equations 


=  iy: 

N 


-  2  i  *:-i 
a  =  —  z  C  z 
n  M  — n  — n 


In  general,  the  solution  must  be  obtained  numerically,  by  iteration. 
Start  the  iteration  by  assuming 


CQ  “  N  23- 


choosing  the  real  constant  b  so  that  (C  =  1.  Invert  C  and  compute  the 
a  2  oil  o  „ 

o  according  to  (5),  use  these  to  compute  a  better  estimate  C, ,  rescale  it, 
n  I 

and  continue  until  convergence,  as  indicated  by  a  sufficiently  small  change  in 

2 

the  ,  is  obtained.  The  rescaling  is  possible  because  the  parameters  speci¬ 
fied  in  the  model  appear  always  as  products;  any  one  of  them  may  be  specified 

arbitrarily.  This  can  be  seen  in  the  likelihood  equations  (4),  (5)  by  the 

2  -12 
fact  that  if  C,£  constitute  a  solution,  then  so  also  do  bC,b  £  .  The  solution 

★ 

can  thus  be  rescaled  at  any  point  in  the  iteration. 

For  brevity,  this  iterative  algorithm  will  be  called  Weighted  Covariance 
Estimation  (WCE) . 

*  Alternatively,  the  superfluous  parameter  could  have  been  specified  at  the 
outset.  However,  it  is  felt  that  this  approach  results  in  a  somewhat  smoother 
exposition. 


A  disadvantage  of  the  algorithm  is  the  large  amount  of  computation  re¬ 
quired.  At  each  stage  of  the  iteration,  a  square  matrix  whose  dimension  is 
that  of  the  observation  vector  must  be  inverted. 

When  the  dimension  of  the  observation  is  2,  special  simplifications 
occur,  and  the  solution  can  be  made  more  explicit. 

Special  Case  :  M  =  2 


After  maximization  with  respect  to  C,  the  LLR  takes  the  form 


l  =  constant  -Nitnjs|  -M^Hn 

In  the  special  case  M  =  2,  the  determinant  of  S  can  be  written 


S11S22~  S12 
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v  l*J2 

ZnlZn2 

12^  2 
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2 
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Let 


C  =  , 

n  z 


n2 

nl 
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The  determinant  becomes 


|  S  |  =  Ew  Zw  I  £  I  2  -  I  Ew  £  *  I 
n  n1  n1  1  n  n  1 


-  E  E  w  w  C  -C 
2  m  n  m  n 1  m  n 1 


1  T 
j  w  RW 


where 


2 


In  terms  of  the  new  unknown  parameters  {w  }  the  LLR  is  given  by 


T  ^ 

i  =  constant  -  Nilnw  Rw  +  2V*  £nw 

—  —  n 


Setting  the  derivatives  with  respect  to  the  w  equal  to  zero  yields  the 

m 

set  of  equations 


1  T 

w  (Rw)  =  —  w  Rw 
m  —  m  N  —  — 


m  =  1, . . .N 


The  solution  is  clearly  independent  of  the  scaling  of  w.  Thus,  it  suffices 
to  seek  the  solution  of 


w  (Rw)  =  1 

m  —  m 


m  =  1, . . .N 


(7) 


and  scale  the  w's  afterward  so  that 


S11  =  C11  =X>n  =  1 


Explicit  solutions  of  (7)  have  been  obtained  for  N  <  4. 


N  =  2:  Any  two  positive  numbers. 


N  =  3: 


W1:W2:W3  =  R23:R13:R12 


N  =  4: 


WV*4  '  (E23R24I,34,1S!lR13E14E341'i!(R12R14R24,'S:(R12K13R23)'S 


For  N  >_  5,  analytic  solution  becomes  prohibitively  complicated,  and  an 
iterative  solution  is  again  required.  Assume  initially  that  all  the  w's  are 
equal  to  N-l  and  solve  equation  (7)  iteratively. 


w  (n+1) 
m 


1 

(Rw(n) ) 

—  m 


m  =  1, . .  .N  (8) 


*  In  the  case  N=2,  these  equations  degenerate  into  identities.  In  this  case, 
the  LLR  is  independent  of  the  w's.  Any  two  numbers  summing  to  1  will  serve 
equally  well. 
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Renormalize  after  each  iteration  so  that  Ew  =1,  and  continue  until  con- 

m 

vergence  is  obtained. 

Once  the  w' s  have  been  determined,  the  ML  estimate  of  the  matrix  C  can  be 
computed. 


C  =  S  = 


1  _  „ 
—  Ew  £ 

N  n  n 


1  * 

t;  Ew  C 

N  n  n 


(9) 


This  algorithm  is  considerably  simpler  than  the  one  for  general  M  because 
the  weights  are  determined  iteratively  from  the  observations,  without  matrix 
inversion,  and  the  normalized  covariance  estimate  C  need  be  computed  only 
once. 

Outlier  Rejection  Property 

The  special  form  taken  by  the  likelihood  equations  for  M  =  2  makes  possi¬ 
ble  an  interesting  heuristic  explanation  of  the  "outlier  rejection"  property 
of  the  WCE  algorithm.  This  behavior  explains  its  relative  immunity  to  corrup¬ 
tion  by  desired  signals  hidden  in  the  interference. 

The  mechanism  for  outlier  rejection  lies  in  the  iteration  (8)  for  the 
weights.  Suppose  that  the  ratios  £n  are  all  nearly  the  same,  except  for 

and  that  the  iteration  begins  with  uniform  weights  w  =  1/N.  The  elements  of 

th  ^ 

the  k  row  and  column  of  the  matrix  R  defined  by  (6)  will  be  much  larger  than 

th 

the  other  elements.  This  will  cause  the  k  element  of  the  vector  Rw  to  be 
much  larger  than  its  other  elements.  The  kfc^  weight  for  the  next  iteration  is 
the  reciprocal  of  this  element  and  so  is  much  smaller  than  the  other  weights. 
The  outlier  has  thus  been  deweighted. 


'  ■  _•  -  .X-  ■  1 
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The  presence  of  a  signal  in  a  particular  one  of  the  observations  causes 


the  ratio  for  that  observation  to  differ  from  those  of  other  observations 
containing  no  signal.  This  observation  then  contributes  little  to  the  esti¬ 
mate  of  the  covariance  matrix.  When  this  cell  is  processed  for  detection 
using  the  estimated  covariance,  improved  detection  probability  results. 

If  all  observations  are  assumed  to  be  statistically  identical,  there  is 
clearly  no  reason  to  treat  one  differently  from  any  other.  Removal  of  the 
assumption  allows  this  possibility.  The  fact  that  it  results  in  greater 
immunity  to  desired  signal  suppression  is  entirely  fortuitous,  however,  since 
no  mention  of  such  signals  was  made  in  the  model.  It  may  be  possible  to 
introduce  the  occasional  presence  of  signals  into  the  model  in  a  statistical 
fashion  and  obtain  an  algorithm  with  even  better  performance. 

It  is  conjectured  that  the  algorithm  has  similar  immunity  for  any  value 
of  M.  However,  no  plausibility  argument  has  been  found,  and  no  simulation  or 
experimental  results  are  available. 

Convergence 

The  iterative  procedure  has  been  programmed  for  the  special  case  M  =  2 
and  no  problems  with  convergence  have  been  encountered.  However,  convergence 
has  not  been  proven  theoretically. 

If  the  statistical  model  proposed  here  is  valid,  WCE  will  improve  upon 
SMI  in  the  sense  of  producing  a  larger  value  of  the  generalized  likelihood 
ratio.  The  initial  covariance  estimate  used  to  start  the  iteration  is  the  SMI 
estimate,  and  each  subsequent  iteration  step  increases  (or  at  least  does  not 
decrease)  the  likelihood  ratio.  This  is  not  inconsistent,  since  SMI  is  based 
on  a  simpler  model.  The  real  issue  is  whether  or  not  the  WCE  model  is  a 
better  representation  of  the  physical  situation  for  the  particular  problem  of 
interest. 

Accuracy  of  the  Estimator 

As  the  WCE  estimator  itself  is  defined  only  implicitly,  no  direct  analyti¬ 
cal  assessment  of  its  accuracy  seems  possible.  One  can,  however,  evaluate  the 
Cramer-Rao  bound  on  the  joint  covariance  matrix  of  the  estimation  errors  in 


all  the  unknown  parameters.  This  provides  a  lower  bound  for  the  covariance 
matrix  of  any  unbiased  estimate.  If  any  estimate  achieves  this  lower  bound, 
it  is  the  maximum  likelihood  estimate  (WCE) .  Also,  the  ML  estimate  is  asymp¬ 
totically  efficient;  that  is,  it  approaches  the  bound  asymptotically  as  the 
number  of  observations  becomes  large. 

The  calculation  of  the  Fisher  information  matrix  J  is  outlined  in  Appen¬ 
dix  B  for  the  case  M  =  2.  The  simplest  result  is  obtained  by  representing 

* 

the  unknown  covariance  matrix  C  in  the  form 


(10) 


The  errors  of  primary  interest  are  those  in  r  and  g,  which  turn  out  to  be 
uncorrelated.  Their  variances  are  bounded  by 


2 

o 

r 


(1-r2) 
-  2N 


(11) 


2 

a 
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2  2 
g  (l-r  ) 

2N 


(12) 


Note  that  the  estimates  are  (potentially)  much  more  accurate  when  the  magni¬ 
tude  of  the  correlation  coefficient  r  is  near  unity. 

It  can  be  shown  that  the  Cramer-Rao  bounds  on  the  errors  in  estimating 
<t>,  r,  g  in  the  constant  covariance  case  (SMI)  are  exactly  the  same  as  those 
just  presented  for  WCE.  This  means  that  the  fluctuations  in  scale  of  the 
covariance  matrix  do  not,  in  principle,  affect  the  accuracy  with  which  the 
elements  of  the  constant  part  can  be  estimated. 

* 

The  ML  estimates  of  these  parameters  are 


’1 


Airborne  MTI 

Up  to  this  point,  the  development  of  the  WCE  algorithm  has  been  kept 
as  general  as  possible.  It  will  subsequently  be  evaluated  and  compared  with 
SMI  for  a  problem  of  particular  interest,  radar  detection  of  slowly  moving 
ground  targets  from  a  moving  aircraft,  using  both  simulated  and  real  radar 
data.  This  section  discusses  this  problem  in  some  detail. 

The  motion  of  the  radar  beam  causes  ground  clutter  returns  to  have  a 
doppler  frequency  shift  which  depends  on  their  position  within  the  beam. 

If  the  radar  antenna  beam  is  fairly  broad  in  azimuth,  ground  clutter  returns 
from  some  part  of  the  beam  will  have  the  same  doppler  frequency  as  the  target 
returns  of  interest,  so  that  conventional  doppler  processing  of  returns  from  a 
single  antenna  cannot  separate  targets  from  clutter. 

One  solution  is  to  add  a  second  antenna  displaced  from  the  first  along 
the  direction  of  motion  of  the  platform,  so  that  the  two  antennas  occupy  the 
same  position  in  space  (or  nearly  so)  at  different  times.  Pulses  transmitted 
at  these  times  yield  nearly  identical  (i.e.,  highly  correlated)  returns  from 
stationary  objects,  while  returns  from  moving  objects  differ  because  of  the 
motion.  Subtraction  of  the  returns  received  by  the  two  antennas  thus  elimin¬ 
ates  most  of  the  ground  clutter,  leaving  only  returns  from  the  moving  objects. 

For  simplicity,  only  processing  of  pairs  of  returns,  one  from  each  an¬ 
tenna,  in  multiple  range  cells  was  simulated.  The  covariance  matrix  of  a  pair 
of  returns  from  the  nth  range  cell  is  shown  in  Appendix  A  to  be  of  the  form 


A  =  a  2C  = 
n  n 


The  signal  to  be  detected  is  of  the  form 


where  B  is  an  arbitrary  complex  amplitude.  An  optimum  processor  for  detecting 
this  signal  computes  |w  zj  and  compares  it  to  a  threshold,  where 


w  =  A  =  C  1s.  Thus 
—  n  —  — 


w  (<J> )  =  A  1s  (<)>)  =  C  1s  (♦)  = 
—  n  —  — 


C22  "C12e 


*  jdi 

— P  4- 

U12  6 


It  is  convenient  and  causes  no  degradation  to  scale  the  weighting  vector 


so  that  w^  =  1, 


— P  + 

12 

w2(*>  =  — - 5? 

C22“C12e  * 


C22~  C12 
C22~C12e^ 


and  detection  is  based  on  |z^  +  W2Z2^'  the  clutter  is  highly  correlated 


'22  >12 


i  2  n  d) 

C17|  <<1)  and  if  C^-C^  e  T  is  not  too  small,  i.e.,  if  the  signal  is 


sufficiently  different  in  doppler  from  the  clutter,  then 


W2  =  "F 


This  means  that  the  optimum  weighting  (13)  which  depends  on  the  unknown 
signal  phase  $ ,  can  be  approximated  by  one  which  depends  only  on  the  statistics 
of  the  clutter,  not  on  the  nature  of  the  desired  signal. 

The  output  signal-to-interference  ratio  (SIR)  of  a  linear  processor  with 
weighting  vector  w  is 


E  j  w  z  |  ‘ 


14 


Simulation  Results 

The  WCE  algorithm  was  compared  with  SMI  using  simulated  data.  The  basis 
of  comparison  was  the  cancellation  ratio  (15) . 

Simulated  data  were  generated  in  40  range  cells.  The  power  levels  (i.e., 
the  <^2)  in  the  cells  were  determined  by  selecting  at  random  40  numbers  be¬ 
tween  20  and  50  dB.  The  results  are  shown  in  Table  I.  For  each  range  cell, 
thirty-two  independent  pairs  of  complex  Gaussian  random  numbers  having  the 
covariance  matrix 

C  = 


were  then  generated  and  scaled  to  the  appropriate  power  level. 

The  SMI  algorithm  estimated  C  _  using  the  first  10  range  cells  and  used 

*  -1 

the  result  to  compute  z^  -c12  Z2  ln  40  cells*  The  input  and  output  inter- 

2  A  “*  1  2 

ference  powers,  |z^|  and  |z  -C^  Z2^  '  eacl1  of  the  40  cells  were  averaged 

over  the  32  independent  trials,  and  the  results  used  to  compute  an  empirical 
cancellation  ratio  for  each  cell.  These  ratios  were  plotted  as  functions  of 
the  input  interference  power. 

The  entire  process  was  repeated  using  the  adaptive  algorithm  proposed  in 
this  paper  on  the  same  data  set. 

Figure  1  shows  the  cancellation  ratios  produced  from  the  data  by  the  ap¬ 
proximately  optimum,  nonadaptive  processor  (14)  which  knows  a  priori  the 
interference  statistics.  The  empirical  cancellation  ratios  are  clustered 
about  the  theoretical  value  of  47  dB.  This  performance  represents  an  upper 
bound  for  the  adaptive  techniques. 
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The  same  data  processed  by  the  SMI  adaptive  algorithm  using  10  cells 
produced  the  results  shown  in  Figure  2.  The  cancellation  has  been  reduced  by 
about  3  dB  by  the  adaptivity.  Note,  however,  that  cancellation  was  consider¬ 
ably  improved  in  one  range  cell.  This  is  range  cell  6,  which  was  one  of  the 
cells  used  to  estimate  the  covariance,  and  has  the  largest  power  level  of 
these  10  cells.  It  tends  to  dominate  the  SMI  estimate,  which  weights  the  10 
cells  equally. 

Figure  3  shows  the  corresponding  result  for  the  WCE  algorithm,  also 
adapting  on  the  first  10  range  cells.  There  is  virtually  no  degradation  due 
to  adaptation.  In  view  of  the  increased  computation  required,  however,  this 
modest  increase  in  performance  is  not  deemed  significant.  The  real  advantage 
is  revealed  when  signals  are  present  in  the  learning  sample. 

To  this  end,  the  data  were  modified  by  the  addition  of  a  simulated 
target  having  4>  =  t  and  adjustable  amplitude  to  range  cell  5. 

Figures  4  a,b  show  the  effect  on  the  cancellation  performance  of  the  SMI 
and  WCE  algoritlims,  respectively,  of  a  20  dB  signal  present  in  cell  5.  The 
interference  power  in  cell  5  is  38.7  dB,  so  the  SIR  is  -18.7  dB.  SMI  per¬ 
formance  has  degraded  by  15  dB,  while  that  of  WCE  is  unaffected  (the  isolated 
point  is  that  for  cell  5) . 

Table  II  lists  the  weights  assigned  to  the  10  learning  cells  on  each  of 
the  32  independent  ti-ials  by  WCE.  Cell  5  is  consistently  deweighted  by  30  or 
40  dB  relative  to  the  other  cells.  It  is  therefore  not  surprising  that  the 
presence  of  the  signal  has  little  effect  on  the  covariance  estimate. 

The  computations  were  repeated  for  a  variety  of  signal  levels,  and  again 
using  only  the  first  5  range  cells  for  adaptation.  The  results  are  summarized 
in  Figure  5.  WCE  performance  is  not  affected  by  the  presence  of  a  signal,  no 
matter  how  strong.  SMI  performance  degrades  at  the  rate  of  about  0.9  dB  in 
cancellation  for  each  additional  1  dB  of  signal. 

The  performance  of  the  WCE  algorithm  begins  to  degrade  when  targets  are 
present  in  half  of  the  range  cells  used  for  covariance  estimation.  As  the 
number  of  targets  increases  beyond  this  point,  WCE  and  SMI  are  both  degraded 
by  about  the  same  amount. 
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Experimental  Results 

The  simulation  compared  the  performance  of  WCE  and  SMI  estimators  using 
data  which  obeyed  the  WCE  statistical  model.  It  was  to  be  expected,  therefore, 
that  the  WCE  estimates  would  be  better.  In  fact,  the  iteration  which  leads  to 
the  WCE  estimate  can  always  be  started  with  the  SMI  estimate,  and  subsequent 
iterations  can  only  increase  the  likelihood  ratio. 

The  most  important  advantage  of  WCE,  as  revealed  by  simulation,  is  its 
tendency  not  to  suppress  desired  signals  present  in  the  learning  sample.  It 
is  important  to  see  whether  or  not  this  advantage  is  maintained  when  using 
real  data  which  does  not  precisely  match  either  the  WCE  or  SMI  statistical 
models.  For  this  reason,  the  two  algorithms  were  compared  using  real  data 
from  an  airborne  L-band  MTI  radar  having  two  antennas.  Data  processing 
consists  of  adaptive  cancellation  of  each  pair  of  pulses  (one  from  each  an¬ 
tenna)  followed  by  Doppler  filtering  via  a  32  point  FFT,  on  each  of  40  range 
cells. 

In  most  cases,  the  two  algorithms  perform  equally  well.  However,  in  the 
example  to  be  presented  here,  a  moving  target  in  range  cell  15  (S/C  =  -12  dB) 
causes  a  significant  difference  in  performance.  The  cancellation  obtained 
with  WCE  using  range  cells  11-20  for  covariance  estimation  is  shown  in  Fig.  6. 
It  is  limited  by  the  thermal  noise  power  level  P^.  If  clutter  is  cancelled 
well  below  the  residual  noise  level,  the  expected  cancellation  ratio  is 

_  eK^2  _  pi 

e{zi-wz2|2  2PN 


since  |w|=  l,  so  that 


r  (dB)  =  P  (dB)  -P  (dB)  -3 
I  N 
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The  cancellation  ratios  should  thus  form  a  straight  line  with  a  slope  of  1 


when  plotted  against  the  interference  power  P  (dB) .  This  behavior  can  be  seen 
in  the  data.  The  Doppler-filtered  canceller  output  is  shown  in  Fig.  7. 

The  results  produced  by  SMI  using  this  data  are  shown  in  Figs.  8,  9. 
Cancellation  is  degraded  by  about  9  dB.  The  flatness  of  the  curve  indicates 
that  thermal  noise  is  no  longer  the  limiting  factor.  The  Doppler-filtered 
residues  show  that  the  detectability  of  small  moving  targets  has  been  reduced 
considerably.  The  difference  in  performance  would  probably  be  much  greater 
were  it  not  for  the  limit  imposed  on  WCE  by  thermal  noise. 

SUMMARY 

An  algorithm  for  generating  a  maximum  likelihood  estimate  of  the  covari¬ 
ance  matrix  of  a  vector  process,  using  multiple  observations,  has  been  de¬ 
veloped.  It  is  based  on  the  assumption  that  the  covariance  matrices  of  the 
observations  are  the  same  except  for  a  scale  factor.  The  resulting  estimate 
is  a  generalization  of  the  sample  covariance  matrix  in  that  it  is  a  weighted 
average  of  the  sample  covariance  matrices  of  the  individual  observations.  The 
weights  are  computed  via  an  iterative  procedure. 

Simulation  results  show  that  the  adaptive  "optimum"  filter  based  on  the 
weighted  estimate  offers  some  improvement  in  interference  rejection  compared 
to  that  based  on  the  sample  matrix  estimate.  The  main  advantage  of  the  weighted 
covariance  estimate,  however,  is  that  it  has  far  greater  immunity  to  errors 
caused  by  the  presence  of  desired  signals  in  the  observations  used  for  covari¬ 
ance  estimation.  It  is  therefore  valuable  in  situations  where  target-free 
interference  observations  cannot  be  guaranteed. 

The  major  disadvantage  of  the  technique  is  the  large  amount  of  computa¬ 
tion  required.  It  is  best  suited  to  problems  in  which  the  dimension  of  the 
vector  observations  is  small. 
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Fig.  3.  Adaptive  cancellation  using  WCE  (10  cells). 
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Fig.  8.  Adaptive  cancellation  of  real  radar  data  usinq 
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APPENDIX  A 


Derivation  of  Clutter  Covariance  Matrix  for 
Multiple  Antenna  Airborne  Radar 


Consider  an  airborne  radar  having  M  antennas  with  phase  centers  located 

at  positions  md  with  respect  to  some  common  reference  point,  moving  with 

constant  velocity  vector  v.  Let  J^<'be  a  vector  from  the  reference  point  to  an 

incremental  clutter  patch  at  coordinates  r,0  at  t  =  0.  The  range  vector  from 
fch 

the  m  antenna  to  this  patch  at  any  time  t  is  then  (Fig.  Al) 


R  (t)  =  R  -md  -tv 
-m  — o  —  — 


For  md  and  tv  sufficiently  small,  its  length  is  approximately 


1  A  • 

R  (t)  =  R  -  —  (R  md  +  tR  -v)  =  R  -md  +  R  t 
m  oR-o  —  -o—  o—o 

o 


(Al) 


« 

Note  that  R  ,d  and  R  are  all  functions  of  r  and  0. 
o  o 

Let  a(r,0)  denote  the  complex  amplitude  of  the  return  from  the  clutter 

fch 

patch  at  (r,0),  and  G  (r,6),  the  complex  gain  of  the  m  antenna  in  that 

XU 

direction.  Assume  that  the  m  antenna  transmits  a  pulse  at  time  t  . 

m 


s  (t)  =  Re{u(t-t  )  exp(j2wf  t) } 
m  mo 


It  is  assumed  that  only  the  transmitting  antenna  receives  the  resulting 

returns,  and  that  all  returns  are  received  before  the  next  transmission  from 

any  other  antenna  occurs.  With  these  assumptions,  the  total  clutter  return 
th  * 

received  by  the  m  antenna  is 


r  (t)  =  Re 
m 


I  //■ 


aG“  u(t-t  — t  (t) )  exp 
m  m 


j2irf  (t— t  (t) ) 
o 


drd6 


(A2) 


where  r(t)  is  the  delay  suffered  by  an  incremental  element  of  return  received 
at  time  t*4^.  The  behavior  of  T(t)  can  be  quite  complicated,  in  general,  but 

*The  (r,0)  dependence  of  a,  Gm,  and  -r(t)  has  been  suppressed  for  brevity. 
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using  the  linear  approximation  (Al)  it  can  be  shown  that  T(t)  is  adequately 
approximated  by 


R  , 

e„  =  —  t  (t)  =  — (R  -md  +  R  t)  =  t  +  t  t 
— R  R  Co  o  mo 

o 


where  d  =  £R*d,  Rq  =  e^*v.  Using  this  and  neglecting  the  effect  of  time 
compression  on  the  complex  modulation  u(t),  equation  (A2)  reduces  to 


r  (t)  =  Re 
m 


{ JIaCi “ 


(t-t  -T  )  exp  <  j 2irf  (t-T 
mm  l  o 


-x  t)>  drd0  > 

m  °  )  ) 


This  return  is  passed  through  a  filter  matched  to  the  transmitted  pulse  shape, 
yielding 


C  (t)  =  /  r  (t)  Re{u  (T-t)  exp  {j2irf  t}}dx 

m  J  m  o 


*  i Ee  [ff° 


CtG  x(t-t  -T  ,f  )  exp  {j2irf  (t  +t  )  } 
m  m  m  a  a  m  m 


exp  {-j27rf  t  }  drd0  exp  {j2v£  t) 
o  m  o 


where  the  Doppler  frequency  f .  =  -f  t  and  t  are  functions  of  (r,0)  and 

d  o  o  m 

X(t,f)  »  J u(x)u  (x-t)exp{ j2Trfx}dT 

is  the  ambiguity  function  of  the  radar  pulse  (see  [4],  p.  70).  The  complex 

* 

modulation  is  sampled  at  time  t  +  x",  yielding  the  complex  clutter  sample 

C  =  ffa G2  »f.)  exp{j2irf  (t  +  x  )}  exp{-j2irf  x  }drd0 

m  J  J  m  m  a  amm  om 

The  factor  1/2  has  been  absorbed  into  o. 


It  is  now  assumed  that  the  maximum  Doppler  frequency  of  the  clutter  is  much 
less  than  the  bandwidth  B  of  the  radar  pulse,  making  possible  the  approximation 


x(t,fd)  =  x(t,0) 

The  covariance  of  any  pair  of  clutter  samples  is  then  given  by 

E(C  C*)  =  /YpgV  X(t'-t  ,0)  /0)  (A3) 

m  n  jj  m  n  m  n 

exp  {j2irf.  (t  -t  )}  exp  {-j2ir(f  -f ,)  (t  -t  )  }drd0 
d  m  n  o  d  m  n 

It  will  be  assumed  that  the  transit  time  across  the  antenna  configuration 

(max  t  -r  )  is  much  less  than  the  reciprocal  of  the  radar  bandwidth;  the  first 
m  n 

m,n 

argument  of  both  ambiguity  functions  in  (A3)  can  then  be  replaced  by 
Their  product  then  becomes  | x (t'-t, 0) j 2 .  This  is  the  envelope  (squared)  of 
the  compressed  radar  pulse  and  is  an  impulse- like  function  of  r  and  therefore 
of  r.  The  remainder  of  the  integrand  varies  very  slowly  with  r  compared  to 
| X I  2 »  so  the  r  integration  merely  produces  a  constant  (/ 1 X I  2 dr)  which  can  be 
absorbed  into  P.  since  it  was  already  assumed  that  fd<<B,  the  last  approxi¬ 
mation  also  implies  that  f ,(t  -t  )  «  1,  so  that  this  phase  term  may  be 

d  m  n 

neglected.  Thus 

E(CCJ  =  f P(0)G^(6)G*  (0)  exp  /  j2ir[f  (t  -t  )-f  (T  -t  )]  \  d0 
m  n  J  m  n  ^  dmnomn) 

where  P,  G,  f ,,  and  t  -t  are  all  implicitly  functions  of  the  sampling  range 
d  m  n 

Ct  > 

R"  =  —  .  Their  variation  with  range  is  very  slow,  however,  except  for  P.  If 
the  clutter  samples  are  confined  to  a  range  interval  which  is  small  compared 
to  the  range,  and  if  the  clutter  power  distribution  is  the  same  in  each  range 
cell  except  for  a  scale  factor,  i.e.,  if 

P(R%0)  =  F(R  ')Q(Q) 
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then  the  covariance  matrix  of  the  clutter  samples  from  a  given  range  cell  is 
simply  a  constant  matrix  multiplied  by  a  scale  factor  which  is  a  function  of 
range. 


Making  the  substitutions 


t  =  mA 
m 


f ,  =  -f  t  =  —  e  -v 

d  o  o  A  — R  — 


f  (t  -t  )  =  -  7  e  'd(m-n) 
o  m  n  A  — R  — 


the  covariance  element  becomes 


E(C  C  )  =  F  (R")  /q(8)G2(8)G  (9)  exp/j2Tr~-  (m-n) (e  *vA  +  e  *d)|  d8 

mn  J  mn  1  A  — R  —  — R  —  ) 

(A4) 

The  effective  phase  center  location  is  defined  to  be  the  position  of  the 
th 

m  antenna  at  the  time  it  transmits.  If  adjacent  antennas  transmit  in  sequence, 

the  vector  motion  of  the  effective  phase  center  between  transmissions  is  then 

d  +  vA.  If  the  vectors  d  and  v  are  collinear,  this  motion  can  be  made  zero  by 

proper  choice  of  A.  If  the  antenna  patterns  are  identical  (G  =  G  ) ,  perfect 

m  n 

clutter  correlation  is  obtained.  More  generally,  if  the  antenna  patterns  are 

real,  as  is  usually  the  case,  maximum  correlation  is  obtained  by  choosing  A  to 

minimize  the  projection  of  d  +  vA  on  the  line  of  sight  e  at  beam  center. 
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APPENDIX  B 


Outline  of  Derivation  of  Information  Matrix 


The  elements  of  the  Fisher  information  matrix  are 


J..  =  E{^ 

i]  9a.  9a. 

i  9 


where  l  is  the  likelihood  ratio  (3)  and  the  a^'s  are  the  unknown  parameters. 
The  constant  part  of  the  covariance  matrix  C  is  parametrized  as  in (10) . 

Let  x  denote  any  one  of  these  real  parameters.  Then 


li  =  _  N  iM  +  y*_l  V1  c"1 

9x  | C  |  9x  Li  2  9x 


z 

— n 


(Bl) 


It  will  be  convenient  to  define 


1 

a  —n 
n 

These  are  complex  Gaussian  random  vectors  with  covariance  C  \  Using  the 

i  -•  [5] 

relation 


W-  WTHC-^. 


and  replacing  z^  by  t,  ,  expression  (Bl)  can  be  written 


Note  that 


so  that 


—  =  Y.U*  —  l  -Tr  {c_1  &■}) 
9x  9x  -^n  1  9 x  / 


EU*  )  =  Tr{E(c  £  )—}  =  Tr(C_1  -jp) 

^  3x  ^  •anan  9x  9x 


E(|f)  -  0 


(B2) 
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Let  y  denote  any  one  of  the  parameters  of  C.  Then 


/9£  3£\  _  „  .  *  3C  m  -1  3C,  «  *  3C  m  ,  -1  3C.  . 

\3x  3y  J  E  ^  ^  3x  ^ii  ‘  Tr(C  3x  £  (^n  3y  V  Tr(C  3y}  ’ 


-£ 


n  L 


t  ?*rL)  ~  Tr (c_1  |r)Tr(C 


-TT^tn  3y  •%' 


3x 


1  -c>] 
3y;J 


(B3) 


The  following  general  result  is  useful  in  evaluating  the  above  expression. 
Let  A,  B  denote  arbitrary  square  matrices  and  let  z  be  a  circular  complex 
Gaussian  vector  with  zero  mean  and  covariance  A.  Then 


*  * 

E(z  Az  z  Bz) 


E(ZZAi  jzi*  j>(KBk£V*) 
13 


k£ 


=  ????  Ai  jBk£E  (ziZjZkZ£) 

I3k£ 

The  expectation  can  be  evaluated  using  Reed's  results 


[6] 


=  EEEZ  A.  .B,  „  (A.  .  A  ,  +  A„  .  A  .,) 
. jka  13  k£  31  **  £i  3k 


=  (EEA  A  )(IZBkAk)  +EZ(EA  A  )(EBkA.) 
13  J  k£.  lk  3  J  £ 


=  Tr ( Aa) Tr (Ab)  +  Tr(AAAB) 


(B4) 


Use  of  this  result  in  (B3)  results  in 


,„,3£  3)^  x,  m  ,„“1  8C  -1  3C. 

***9?  =  NTr(C  37  C  3? 


(B5) 
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The  other  parameters  can  be  treated  in  a  similar  fashion . 


Using  (3) 


for  the  case  M  =  2, 


*  ~1 

o  z  C  z  ,  * 
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Again  note  that 
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-^2  (Tr(CC_1)-2)  =  0 


The  elements  of  the  information  matrix  corresponding  to  the  are 
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Use  of  (B4)  results  in 


E(_M__iL.)  =  [Tr (C-^CC_1C)  +  Tr(c"1C)Tr(C_1C)-4] 

3a  3a  a 
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6 

4  mn 


Finally,  for  the  cross  terms 
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Evaluation  of  (B5) ,  (B6),  (B7)  yields  the  information  matrix  J. 
vector  of  unknown  parameters  is  ordered 
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($,r,g,  o^f 
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the  result  is 


2  Tr(C 


(B7) 


If  the 
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2  -1 

where  b  =  (1  -r  ) 

The  phase  estimation  error  is  seen  to  be  independent  of  all  the  other  esti¬ 
mation  errors.  Its  variance  is  bounded  by 


The  remaining  portion  of  J  can  be  written  as 


J'  - 


(B8) 


where  D  is  diagonal  with  =  b,  d^ 


=  9 


-1 


{d 


kk 


=  a 


k-2 


k  =  3, 


N  +  2} 


B2xN 


-r  -r  ...  -r 


1  1  .  .  .  1 


and  I  is  the  NxN  identity  matrix.  Application  of  the  formula  for  the  inverse 
of  a  partitioned  matrix  to  (B8)  yields 
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where 


T  -1. 
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